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Abstract 

Entropic arguments are shown to play a central role in the foundations 
of quantum theory. We prové that probabilities are given by the modulus 
squared of wave functions, and that the time evolution of states is linear 
and also unitary. 

1 Introduction 

One of the curious features of quantum mechanics is that it is a theory in which 
probabilities play a most central role and yet, from a foundational point of 
view, the concept of entropy is conspicuously absent. Entropy appears only 
later as an auxiliary quantity to be used only when a problem is sufficiently 
complicated that clean deductive methods have failed and one is forced to use 
dirticr inference methods. This is curious indeed because once the use of the 
notion of probability has been accepted, the issue of whether or not quantum 
mechanics is a theory of inference has been unequivocally settled. Quantum 
theory should be regarded as a set of rules for reasoning in situations where 
even under optimal conditions the information available to predict the outeome 
of an experiment may still turn out to be insufhcient. In such a theory entropy, 
as the measure of the amount of information should play a central role. 

It is difficult to avoid the fecling that perhaps the use of entropic arguments has 
been inadvertently encoded into the usual postulates of quantum mechanics. 
The purpose of this paper is to show that this is in fact the case. 

This paper is a continuation of previous work in which quantum the- 
ory is formulated as the only consistent way to manipulate the amplitudes for 
quantum processes. The result of this consistent-amplitude approach is the 
Standard quantum theory Q , in a form that is very close to Feynman's || . The 
first and most crucial step is a decision about the subject matter. We choose a 
pragmàtic, operational approach: statements about a system are identified with 
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those experimental setups designed to test them; the objective is to predict the 
outcomes of experiments j?J . The observation that if two setups are related in 
some way then information about one may be relevant to predictions about the 
other leads one to identify the possible relations among setups and to introduce 
the means for handling these relations quantitatively. This is the role played 
by the amplitudes. Amplitudes are tools for reasoning that encode information 
about how one builds complicated setups by combining more elementary ones. 
The question of how amplitudes are used to predict the outcomes of experiments 
is addressed through a single general interpretative rule. A brief summary is 
given in section 2. 

It is quite remarkable that although the interpretative rule does not in itsclf 
involve probabilities it can be used to prové the Born statistical "postulate" (or, 
better, Born "rule") || provided one extra ingredient is added. The point is 
that the application of the interpretative rule requires a criterion to quantify 
the change in amplitudes when setups are modified. In ref. the criterion 
adopted was to use the Hilbert norm as the means to measure the distance 
between wave functions. This may seem reasonable but such a very technical 
assumption without an obvious physical basis clearly detracts from the beauty 
of the argument. In section 3 this blemish is corrected. The realization that the 
components out of which setups are built, the filters, already supply us with a 
notion of orthogonality brings us very close to the inner product needed for the 
Hilbert norm. We close this remaining gap using a symmetry argument. The 
implication is very interesting: the opinion that quantum probabilities diffcr 
in an essential way from ordinary classical probabilities is very widespread; not 
only are they calculated using apparently different rules but, being given a priori 
by the Born rule, they also seem to depend less on information, to be more 
"objective" . Our proof of the Born rule, tracing it to a form of the principle 
of insuficient reason, supports the opposite point of view, that such differences 
are an illusion, that there is only one kind of probability. 

The constraint that amplitudes be assigned consistently leads to a time evo- 
lution that is linear || but remains silent about whether it should also be uni- 
tary. A common explanation is that imposing unitary time evolution guarantees 
that probabilities be conserved. This is true but it is also irrelevant; that proba- 
bilities should add up to one is a matter of definition |9) ; the "non-conservation 
of probabilities" can always be "fixed" by a suitable reinterpretation. The usual 
symmetry arguments based on Wigner's theorem are also inadequate. Why 
should time evolution be a symmetry in the very technical sense of preserv- 
ing inner produets? Finally, arguments based on von Neumann's entropy are 
circular |]ïo| , they implicitly assume that one can measure observables other 
than position [pT) , an assumption which itself (see section 5) relies on unitary 
evolution. 

The argument we offer in section 4 is based on the idea of array entropy, a 
concept that was bricfiy introduced by Jaynes JÏ2| only to be dismissed as an 
inadequate candidate for the entropy of a quantum system, a quantity which 
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hc rightfully idcntified with von Neumann's entropy. From our point of view, 
however, amplitudes and wave functions are assigned not just to the system 
but to the whole experimental setup, and this turns the array entropy into a 
very useful notion. The idea is simple. In situations where the information 
available for the prediction of experimental outcomes is not spoiled by just 
waiting entropy should be conserved. Our problem is to identify the appropriate 
entropy (it is the array entropy). Its conservation implies the conservation of 
the Hilbert norm and unitary evolution. As claimed above, the postulate that 
time evolution is unitary is derivable from an entropic argument. 

2 The consistent-amplitude approach to quan- 
tum theory 

The objective of quantum theory is to predict the outcomes of experiments; 
statements about the quantum system are identified with the experimental se- 
tups designed to test them M . To avoid irrelevant technical distractions we 
consider a very simple system, a particle that lives on a discrete lattice and 
has no spin or other infernal structure. The generalization to more complex 
configuration spaces should be straightforward. 

The simplest experimental setup, denoted by [xf,Xi], consists of placing a 
source that prepares the particle at a space-time point Xi — (xí,U) and placing 
a detector at Xf = (xf,tf). To test a more complex statement such as "the 
particle goes from xí to x\ and from there to Xf" denoted by [xf,xi,Xi], requires 
a more complex setup involving an idealized device, a "filter" which prevents 
any motion from Xi to Xf except via the intermediate point x±. This filter is 
some sort of obstacle or screen that exists only at time t\, blocking the particle 
everywhere in space except for a small "hole" around x%. The possibility of 
introducing many filters each with many holes leads to allowed setups of the 
general form a — [xj, s^, sjv-i, ■ ■ ■ , «27 s%, Xi] where s n = (x ni x' n , x", . . .) is a 
filter at time t n , intermediate between ti and í/, with holes at x n , x^, x'^, . . . 

There are two bàsic kinds of relations among setups. The first, called and, 
arises when two setups a and b are placed in immediate succession resulting in 
a third setup which we denote by ab. It is necessary that the destination point 
of the earlier setup coincide with the source point of the later one, otherwise 
the combined ab is not allowed. The second relation, called or, arises from the 
possibility of opening additional holes in any given filter. Specifically, when 
(and only when) two setups a' and a" are identical except on one single filter 
where none of the holes of a! overlap any of the holes of a" , then we may form 
a third setup a, denoted by a 1 V a", which includes the holes of both a! and 
a" . Provided the relevant setups are allowed the bàsic properties of and and 
or are quite obvious: or is commutative, but and is not; both and and or are 
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associative, and finally, and distributes over orí^j 

A quantitative representation of and/or is obtained by assigning a singlo 
complex number </>(a) to each setup a in such a way that relations among setups 
translate into relations among the corresponding complex numbers. What gives 
the theory its robustness, its uniqueness, is the requirement that the assignment 
be consistent: if there are two different ways to compute 4>(a) the two answers 
must agree. The remarkable consequence of the consistency constraints is the 
possibility of regraduating <f>(a) with a function ip to switch to an equivalent and 
particularly convenient representation, ip(a) = if)((f>(a)), in which and and or are 
respectively represented by multiplication and addition. Explicitly, tp (ab) = 
t/j (a) tp (b) and ^(aVa') = tp (a) + tp (a'). Complex numbers assigned in this 
way are called "amplitudes" . For a similar (earlier) derivation of the quantum 
sum and product rules see ref. 

The observation that a single filter that is totally covered with holes is 
equivalent to having no filter at all leads to the fundamental equation of mo- 
tion. The idea is expressed by writing the relation among setups jxf, Xi] = 
Vall xatt([ x /' ^tJI^ti X A) m terms of the corresponding amplitudes ||. Using 
the sum and product rules, we get tp(xf,Xi) = J2aiixa.tt V^/i x t) Í>(xt, Xi). 

Following Feynman we introduce the wave function ^(x, t) as the means 
to represent those features of the setup prior to i that are relevant to time 
evolution after t. Notice that there are many possible combinations of starting 
points Xi and of interactions prior to the time t that will result in identical 
evolution after time t. What these different possibilities have in common is that 
they all lead to the same numerical value for the amplitude ip(x t , x{). Therefore 
we set ^(x, t) — ip(xt,Xi) and all reference to the irrelevant starting point Xi 
can be omitted. The traditional language is that describes the state of the 
particle at time t, that the effect of the interactions was to prepare the particle 
in state Now we see that the word "state" just refers to a concise means of 
encoding information about those aspects of the setup prior to the time t that 
are relevant for evolution into the future. 

The equation of motion can then be written as 

9(S f ,tf)= Í>(xf,tf,x,t)V{£,t), (1) 

all x at t 

which is equivalent to a linear Schròdinger equation as can easily be seen || 
by differentiating with respect to tf and evaluating at tt =t. 

This result is important because a variety of nonlinear modifications of quan- 
tum mechanics have been proposed, either attempting to solve the problems 

1 These are physical rather than logical connectives. JThey represent our idealized ability 
to construct more complex setups out of simpler ones W and they differ substantially from 
their Boolean and quantum lògic counterparts. In Boolean lògic not only and distributes 
over or but or also distributes over and while in quantum lògic propositions refer to quantum 
properties at one time rather than to processes in time. 
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with macroscopic quantum supcrpositions, or to explore the possibility that the 
linear theory might just be a low "intensity" limit of the true nonlinear theory 
[ p"4| . Furthcrmore, even though experimental bounds have become increasingly 
stringent [ |Ï5| experimentation alone cannot logically rule out small nonlineari- 
ties. 

The question of how amplitudes or wave functions are used to predict the 
outcomes of experiments is addressed through the time evolution equation ([ï]). 
For example, suppose the preparation procedure is such that ^(x,t) vanishes 
at a certain point xq. Then, according to eq. (0), placing an obstacle at the 
single point (xq, t) (i.e., placing a filter at í with holes everywhere except at Xo) 
should have no effect on the subsequent evolution of ^. Since relations among 
amplitudes are meant to reflect corresponding relations among setups, it seems 
natural to assume that the presence or absence of the filter will have no effect 
on whether detection at Xf occurs or not. Therefore when ty(xo,t) = the 
particle will not be detected at (xo,t). 

This idea can be generalized to the following general interpretative rule: 
Suppose the wave function of a setup is \I/ (í) and at time í one introduces 
or removes a filter that blocks out those components of ^ characterized by a 
certain property V. Suppose further that this modification of the setup has a 
negligible effect on the evolution of 4" after í. Then when the wave function is 
í (í) property V will not be detected. 

In ref. || we showed how this interpretative rule implies the Born statistical 
postulate provided one uses the Hilbert norm as the means to quantify the 
change in the wave function as it evolves through a filter. In the next section 
we show why the choice of the Hilbert norm is the natural one. 

3 The Hilbert inner product 

In order to justify the use of the Hilbert norm we show how the concepts of 
distance and angle among states, that is an inner product, can be physically 
motivated. The argument has three parts. 

First, we note that wave functions form a linear space. To illustrate this point 
suppose that í'l (x, t) = ijj(x, í; X\ , t ) is the wave function at time í of a particle 
that at time ío was prepared at the point X\, and ^ 2 (x,t) = i)(x, í; x 2 , to) is 
the wave function at time í of a particle that at time í was prepared at the 
point X2- One way to prepare linear superpositions of ^f\(x, í) and ^2(x,t) is by 
placing the source at an initial point (xj,íj) with ti earlier than ío an d letting 
the particle evolve through a filter at ío with holes at x\ and x*2- Then the 
amplitude ip(x,t]Xi,U) is 

ip(x,t]Xi,U) = ip(x,t;xi,t )ip(xi,t ;xi,ti) + ip(x,t;xi,t )ip(xi ,t ;xi,ti) , (2) 

and, in an obvious notation, the wave function at time í is given by the superpo- 
sition ^(x, í) = a^i(x, í) + (3'í>2(x, t). Notice that the complex numbers a and 
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(3 can be changed at will by changing the starting point (xí, U) or by modifying 
the setup between U and to in any arbitrary way. 

The second part of the argument is to point out that the bàsic components 
of setups, the filters, already supply us, without any additional assumptions, 
with a concept of orthogonality. The action of a filter P at time t with holes 
at a set of points x p is to turn the wave function ty(x) into the wave function 
pqf(x) = J2 P àx,x p ^(x), an d since filters P act as projectors, P 2 = P, any given 
filter defines two special classes of wave functions. One is the subspace of those 
wave functions such as Ü/p = PÜ> that are unaffected by the filter, PÜ/p = Ü/p. 
The other is the subspace of those that are totally blocked by the filter, such as 
Ü/i_p = (1 — P)Ü/, for which PÜ/i_p = 0. We will say that these two subspaces 
are orthogonal to each other. 

Any wave function can be decomposed into orthogonal components, Ü/ = 
Ü/p + Ü/i_p. A particularly convenient expansion in orthogonal components is 
that defined by a complete set of elementary filters. A filter Pj is elementary if 
it has a single hole at £4, it acts by multiplying Ü/(:r) by fe,^; the set is complete 
if Pi = 1- Then = A t 5 s , Si , where A t = Ü>(xí). 

In the third and last step of our argument, as a matter of convenience, we 
switch to the familiar Dirac notation. Instead of writing Ü/(x) and we 
shall write |Ü>) and \i), so that |Ü/) = Ei-^-il*)- The question is what else, in 
addition to the notion of orthogonality described above, is needed to determine 
a unique inner product. Recali that an inner product satisfies three conditions: 
(a) (Ü/|Ü>) ^ with (ü/|Ü>) = if and only if |Ü/) = 0, (b) linearity in the second 
factor ($|aiü>i + = «í^l^i) + ce 2 (^\^2), and (c) antilinearity in the 

first factor, ($|Ü/) = (Ü/|Ü>)*. Conditions (b) and (c) determine the product 
of the state |Ü>) = Bj\j) with |Ü/) = Ej^'K) in terms of the product of 
\j) with \i), ($|Ü/) = z2i B1- Ai(j\i) . The orthogonality of the basis functions 
5&,éi is encoded into the inner product by setting = for i ^ j, but the 
case i = j remains undetermined, constrained only by condition (a) to be real 
and positive. Clearly an additional ingredient is needed. What could be more 
natural than the symmetry argument that if space itself is homogeneous then 
there is a priori no reason to favor one location over another? We therefore 
choose equal to a constant which, without losing generality, we set equal 
to one. Thus the principle of insufficient reason enters quantum theory through 
the inner product, — =>■ ($|Ü/) = J2i B*A i: and this leads to the Hilbert 

norm ||*|| 2 = Ei |Af- 

One should emphasize that the symmetry argument invoked here differs from 
the usual symmetry arguments leading to conservation laws through Noether's 
theorem. The latter depends strongly on the particular form of the Hamiltonian; 
the former does not. 

The deduction of the Born statistical rule now proceeds as in ref . j| . Briefly 
the idea is as follows. We want to predict the outcome of an experiment in which 
a detector is placed at a certain Xk when the system is in state |Ü/} = J^i 
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In M we showed that the state for an cnsemble of N identically prepared, 
independent replicas of our particle is the product \^n) — Yía=i ^q}- Now 
we apply the interpretative rule. Suppose that in the iV-particle configuration 
space we place a special filter, denoted by Pf e , the action of which is to block 
all components of \^n) except those for which the fraction n/N of replicas at 
Xk lies in the range from f — e to f + e. The diffcrence between the states 
PÍJ&n) and I^n) is measured by the relative Hilbert distance, ||Pj e |í'jv} - 
I*Jv)H 2 /(*Jv|Í'jv}- The result of this calculation is f§ 

hm \\PÏ e V N -y N \\ 2 = 1- [ I+£ ó{f -\A k \ 2 )df , (3) 

N-+oc J ' J f _ £ 

where we have normalized ('í'I'í') = ('í'nI'&n) = 1- We see that for large N the 
filter Pj E has a negligible effect on the statc \^n) provided / lies in a range 2e 
about \Ak\ 2 . Therefore the state \^n) does not contain any fractions outside 
this range. On choosing stricter filters with e — > we conclude that detection 
at Xk will certainly occur for a fraction \A^\ 2 and that it will not occur for a 
fraction 1 — | Ak \ 2 . For any one of the ídentícal individual replicas there is no such 
certainty; the best one can do is to say that detection will occur with a certain 
probability Pr(fc). In order to be consistent with the law of large numbers the 
assigned value must agree with the Born rule, 

Pr(fc) = |A fe | 2 . (4) 

Had we weighted the |i)'s differently and chosen a different normalization 
= Wi, the probability would be given by Pr(i) = iUj|j4j| rather than by 
eq.(Q). It is instructive to explore this issue further particularly in the continuum 
limit. Let us weight each celi of the discrete lattice by its own volume, call 
it gj^Ax, and let Ax — > dx. Replacing w^ l \i) = (g 1 / 2 Ax)' 1 ^) by\x) the 
completeness condition 1 = J^-Pj = X^i w ï 1 1*) (*l becomes 1 = J g x l 2 dx \x)(x\. 
Next, replace Sij/Ax by 5{x — x') and the inner product = WiSij becomes 
{x\x') — g~ 1 / 2 8(x~x'). Finally, replace Ai by A(x) and the state I*) = J2i ^íN) 
becomes |^) = J g 1 / 2 dx A(x) \x). The Born rule, eq. (^), becomes Pr(cfe) = 
g x / 2 dx \ A{x)\ 2 ; \A{x)\ 2 is the probability density. These results apply both to 
situations in which the choice of coordinates is such that the homogcncity of 
space is not obvious, and also to intrinsically inhomogeneous, curved spaces. 

It is somctimes argued that while there is an element of subjectivity in the 
nature of classical probabilities that quantum probabilities are different, that 
they are totally objective because they are given by \A\ 2 . We have just shown 
that this assignment is neither more nor less subjective than say, assigning 
probabilities to each face of a die. Just like one assigns probability 1/6 when 
there is no reason to favor one face of a die over another, the Born rule follows, 
even in curved spaces, from giving the same a priori weight, the same preference, 
to spatial volume elements that are equal. (Perhaps it should be the other way 
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around: equally preferred spatial regions are assigned equal volumes. This would 
explain what a physical volume is: just a measure of a priori preference.) We 
have thus uncovered an interesting connection between quantum theory and 
the geometry of space. The full implications of this connection remain to be 
explored. 

4 Array entropy and unitary time evolution 

When we know everything that is relevant about the experimental setup prior 
to time t — we know ^(x, 0); this situation is one of optimal information. 
But if less information is available perhaps the best we can do is conclude 
that the actual preparation procedure is one among several possibilities a = 
1, 2, 3, ... each one with probability p a . (For simplicity we initially assume these 
possibilities form a discrete set.) The usual linguistic trap is to say the system is 
in state ^f a (x, 0) with probability p a , but it is better to say that the preparation 
procedure is ^ a (x, 0) with probability p a . To this state of knowledge, which one 
may represent as a set of weighted points in Hilbert space, and which Jaynes 
referred to as an arrayf] Q, one may associate the entropy 



Jaynes' objection to using this quantity as the entropy of the quantum system 
is that if the ^f a (x,0) are not orthogonal then the p a are not the probabilitics 
of mutually exclusive events. When regarded as a property or an attributc 
of the quantum system the various ^ a (x,0) need not, in fact, be mutually 
exclusive; if (\í r Q ,|\I f i g) ^ 0, knowing that the system is in ^ a (x,Q) does not 
exclude the possibility that it will be found in $ p(x, 0). However, if the $ a (f, 0) 
are attributes of the preparation procedure then they are mutually exclusive 
because the preparation devices are macroscopic! Sa is the entropy of the 
preparation procedure not the entropy of the quantum system. 

The importance of this conceptual point cannot be overemphasized and a 
more explicit illustration may clarify it further. Consider a spin 1/2 particle 
prepared either with spin along the z direction or with spin along the x direction. 
These states are not orthogonal and by looking at the particle there is no sure 
way to teli which of the two alternatives holds, and yet the slightest glimpsc 
at the Stern-Gerlach magnets will reveal which of the two mutually exclusive 
orientations was used. One can distinguish non-orthogonal states by looking 
at the devices that prepared the system rather than by looking at the system 



Turning to the issue of time evolution, we consider situations where those 
parts of the setup after time are known and no further uncertainty is intro- 

2 If the states are normalized the points of the array lie on the surface of a unit sphere, but 
normalization is not necessary for our argument. 




(5) 



itself. 
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duced. Under these conditions the points of the new array are shifted from 
ty a {x, 0) to ^ a {x, t) but their probabilities p a and the corresponding array en- 
tropy Sa remain unchanged. 

So far our uncertainty about the preparation procedure was of a rather 
simple nature, it led to a probability distribution defined over a discrete array. 
But in general there is no such restriction and we may deal with a continuous 
array. The simplest continuous array is one dimensional, a weighted curve C 
in Hilbert space. We could consider higher dimensional arrays but this would 
unnecessarily obscure the argument. The reparametrization-invariant entropy 
of this continuous array is (ï(| 



where p(a)da is the probability that the preparation procedure lies in the 
interval between a and a + da and £(a)da is a measure of the distance in 
Hilbert space between *S> a (x, 0) and ^ a+ d a (x, 0). As discussed in the last sec- 
tion the Hilbert norm is the uniquely natural choice of distance, thus £(a)da = 



The possibility of continuous arrays adds a new twist to our considerations 
about time evolution. Again we consider setups for which no further uncertainty 
is introduced between times and t. We find that points ^! a {x, 0) of the old 
line array at t = will move to points ^ a (x, t) to form a new line array at 
time t. Since no information was lost between times and t we expect that, 
just as in the discrete case, the probabilities p(a)da remain unchanged and the 
corresponding array entropy Sa is conserved. But entropy conservation, 



should hold for any curve C and any function p(a), therefore d£(a)/dt = 0. 
Thus the conservation of the array entropy leads to the conservation of Hilbert 
space distances. Time evolution must be unitary; the Hamiltonian must be 
Hermitian. 

5 Observables other than position 

We have only discussed the measurement of position. How about other ob- 
servables, uncertainty relations, and so many other notions that are central in 
Standard quantum theory? Our brief answer (a more detailed discussion will 
appear in [ïq] ) is that other observables are useful concepts in that they facili- 
tate the description of complex experiments but, from our point of view, they 
are of only secondary importancc and play no role at the foundational level. 

One can effectively build more complex detectors by modifying the setup (by 
introducing, e.g., magnètic fields or diffraction gratings) just prior to the final 





Vat+da) - |*«)]]- 





position detection at Xf. The skill of an experimentalist consists of arranging 
the interactions between time t and the time of detection tf in such a way 
that each state $> n (x,t) of an orthogonal set evolves to a corresponding state 
4> n (x,tf) = S^ t s n which also form an orthogonal set. Then, when the particle 
is finally detected at time tf we say that "at time tf the particle was found 
at x n " or alternatively, we convey the same information by saying, somewhat 
inappropriately, that "the particle was found to be in state $„ (x, t) at time í" . 

What this particular complex detector "measures" is all observables of the 
form Q = J2 n f n \®n)(&n\- It is noteworthy that the eigenvalues /„ need not 
be real; the observables Q are diagonalizable, i.e., normal = 0), but 

not necessarily Hermitian. Clearly, this notion of observables other than posi- 
tion can only be introduced after one understands that time evolution must be 
unitary. 

6 Final remarks 

For over a century now an enormous effort has been directed towards deriving 
the second law of thermodynamics from the laws of mechanics. The successive 
realization by Gibbs, by Shannon |Q, and even more so by Jaynes [^) that the 
validity of entropic arguments rests on elements that are foreign to mechanics 
opens the way to inverting the lògic and deriving the laws of mechanics from 
those same principies of inference which lie at the heart of thermodynamics. In 
fact, according to Jaynes' beautiful explanation |f7j, the validity of the second 
law hinges on the conservation of Gibbs' or von Neumann's entropies. Could 
this conservation also be used to deduce unitary time evolution? No. Such 
arguments would be circular because these entropies rely for their very definition 
on having singled out certain measures (phase space volumes, and Hilbert norms 
respectively) as being privileged and the only reason they are special is precisely 
that they are conserved under unitary time evolution. 

In this work, however, we have given an argument that singles out the Hilbert 
norm without appealing to unitarity; this clears the road to defining an entropy, 
the array entropy, from the conservation of which one can deduce the unitarity 
of time evolution. 

The mystery of why complex numbers are sufficient to encode information 
about relations between setups remains. It seems that one could use other math- 
ematical objects with the required associativity and distributivity, for example 
matrices or other Clifford numbers JÏ9| . The recent work by Rodríguez [^0| 
may contain important steps in exploring this possibility from a rather different 
point of view. My own belief is that the connection between the quantum inner 
product and spatial measures of volume strongly suggests that the reason for 
complex numbers will be found in the geometry of space. Perhaps eventually 
even the geometry of space itself will be determined by entropic considerations 
as well. 
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